Exploiting Natural Language Structures in Software Informal Documentation
نویسندگان
چکیده
Communication means, such as issue trackers, mailing lists, Q&A forums, and app reviews, are premier means of collaboration among developers, between developers end-users. Analyzing sources information is crucial to build recommenders for example suggesting experts, re-documenting source code, or transforming user feedback in maintenance evolution strategies developers. To ease this analysis, previous work we proposed Development Emails Content Analyzer (DECA), a tool based on Natural Language Parsing that classifies with high precision development emails' fragments according their purpose. However, DECA has be trained through manual tagging relevant patterns, which often effort-intensive, error-prone requires specific expertise natural language parsing. In paper, first show, an empirical study, the extent producing rules identifying patterns effort, depending nature complexity patterns. Then, propose approach, named Nlp-based softwarE dOcumentation aNalyzer (NEON), automatically mines rules, minimizing effort. We assess performances NEON analysis classification mobile discussions, issues. simplifies identification definition processes, allowing savings more than 70 percent time otherwise spent performing activities manually. Results also show NEON-generated close manually identified ones, achieving comparable recall.
منابع مشابه
Hypertext Structures and Software Documentation
Software documentation represents a critical resource to the successful functioning of many enterprises. However, because it is static, documentation often fails to meet the needs of the many diverse users who are required to consult it on a regular basis in the course of their daily work. Software documentation is a rich resource that has not been fully exploited. Treatment of software documen...
متن کاملApplying Hypertext Structures to Software Documentation
Software documentation represents a critical resource to the successful functioning of many enterprises. However, because it is static, documentation often fails to meet the needs of the many diverse users who are required to consult it on a re gular basis in the course of their daily work. Software documentation is a rich resource that has not been fully exploited. Treatment of software docume...
متن کاملNatural language processing for documentation analysis
In view of the increasing interest in ontologies as a source of world knowledge, this deliverable presents different types of ontologies and describes the approach adopted within the Klase project towards the problem of mapping specialized linguistic ontologies to generic resources. It reports on investigations related to the possibility of applying linguistic ontologies to the problem of inter...
متن کاملNatural Language in Software Engineering
The large time and effort devoted to software maintenance can be reduced by providing software engineers with software tools that automate tedious, error-prone tasks. However, despite the prevalence of tools such as IDEs, which automatically provide program information and automated support to the developer, there is considerable room for improvement in the existing software tools. The authors’...
متن کاملExploiting Parse Structures for Native Language Identification
Attempts to profile authors according to their characteristics extracted from textual data, including native language, have drawn attention in recent years, via various machine learning approaches utilising mostly lexical features. Drawing on the idea of contrastive analysis, which postulates that syntactic errors in a text are to some extent influenced by the native language of an author, this...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Software Engineering
سال: 2021
ISSN: ['0098-5589', '1939-3520', '2326-3881']
DOI: https://doi.org/10.1109/tse.2019.2930519